Structured Depth Prediction in Challenging Monocular Video Sequences

نویسندگان

Miaomiao Liu

Mathieu Salzmann

Xuming He

چکیده

In this paper, we tackle the problem of estimating the depth of a scene from a monocular video sequence. In particular, we handle challenging scenarios, such as non-translational camera motion and dynamic scenes, where traditional structure from motion and motion stereo methods do not apply. To this end, we first study the problem of depth estimation from a single image. In this context, we exploit the availability of a pool of images for which the depth is known, and formulate monocular depth estimation as a discrete-continuous optimization problem, where the continuous variables encode the depth of the superpixels in the input image, and the discrete ones represent relationships between neighboring superpixels. The solution to this discrete-continuous optimization problem is obtained by performing inference in a graphical model using particle belief propagation. To handle video sequences, we then extend our single image model to a two-frame one that naturally encodes short-range temporal consistency and inherently handles dynamic objects. Based on the prediction of this model, we then introduce a fully-connected pairwise CRF that accounts for longer range spatio-temporal interactions throughout a video. We demonstrate the effectiveness of our model in both the indoor and

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Intra Mode Decision for Depth Map coding in 3D-HEVC Standard

three dimensional- high efficiency video coding (3D-HEVC) is the expanded version of the latest video compression standard, namely high efficiency video coding (HEVC), which is used to compress 3D videos. 3D videos include texture video and depth map. Since the statistical characteristics of depth maps are different from those of texture videos, new tools have been added to the HEVC standard fo...

متن کامل

Covariance Scaled Sampling for Monocular 3D Body Tracking

We present a method for recovering 3D human body motion from monocular video sequences using robust image matching, joint limits and non-self-intersection constraints, and a new sample-andrefine search strategy guided by rescaled cost-function covariances. Monocular 3D body tracking is challenging: for reliable tracking at least 30 joint parameters need to be estimated, subject to highly nonlin...

متن کامل

Depthless Streaming of Depth-based 3d Videos

In this brief on-going research paper, we summarize our current work on reconstructing the depth map from a fusion of multiple estimated depth maps that are generated from a number of multiple monocular cues. We first analyze a ground truth depth map to extract a set of depth cues or statistics. Then, using these depth cues, we process the colored reference video and generate an estimate of the...

متن کامل

Variational methods for dense depth reconstruction from monocular and binocular video sequences

xxi

متن کامل

ℋC-search for structured prediction in computer vision

The mainstream approach to structured prediction problems in computer vision is to learn an energy function such that the solution minimizes that function. At prediction time, this approach must solve an often-challenging optimization problem. Search-based methods provide an alternative that has the potential to achieve higher performance. These methods learn to control a search procedure that ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1511.06070 شماره

صفحات -

تاریخ انتشار 2015

Structured Depth Prediction in Challenging Monocular Video Sequences

نویسندگان

چکیده

منابع مشابه

Fast Intra Mode Decision for Depth Map coding in 3D-HEVC Standard

Covariance Scaled Sampling for Monocular 3D Body Tracking

Depthless Streaming of Depth-based 3d Videos

Variational methods for dense depth reconstruction from monocular and binocular video sequences

ℋC-search for structured prediction in computer vision

عنوان ژورنال:

اشتراک گذاری